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1, INTRODUCTION 

Optical Character Recognition (OCR) is a branch of pattern recognition and artificial intelligence. It 
contributes enormously to human-machine communication. It is the computational process of converting 
images of printed or handwritten text into machine-readable text that is easily reproducible by a computer 
system (such as Microsoft Word). OCR is more specifically concerned with forms of information acquired 
from a printed paper document, such as invoices, computerized receipts, passport documents, bank statements, 
letters, books or any appropriate documentation. The realization of such a process is very complicated because 
there is an infinity of representations of writing. Indeed, each person has a unique writing style and there are 
many styles and fonts in printed characters. Thus, character recognition systems are adapted to the type of 
writing envisaged (printed, manuscript or cursive). 

Thanks to recent advances in computer science, many techniques of handwriting recognition have also 
been improved, especially for Latin and Arabic writing [1, 2]. Thus, in recent years, with the growth of the 
means of communication, other alphabets such as the Tifinagh alphabet of the Amazigh language have 
integrated in the information systems. This has led to the appearance of other types of documents where writing 
is not yet handled and therefore more difficult to recognize. Text recognition of such documents requires more 
specific processing techniques. 

In this paper, the aim is to apply an OCR system on the Amazigh handwriting (Tifinagh). The 
Amazigh language is spoken today by a large number of populations all over the world, mainly in the North 
Africa. It is normally written from left to right and vertically from top to bottom. The Amazigh writing is non- 
cursive which simplifies the segmentatiFon of characters in a text image. Figure | illustrates the Tifinagh 
characters adopted by IRCAM. 
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Comparing with Roman, Arabic or Chinese, research on the recognition of Amazigh writing has not 
reached perfection. As far as authors know, very few attempts have been made on the recognition of Amazigh 
handwriting. Amrouch et al. [3] described a global approach for the recognition of handwritten Tifinagh 
character in which they used Hough transform as feature extraction method and Hidden Markov Models 
(HMM) for classification. The system produced promising result, however the discrimination of this model is 
not good enough due to learning step when a single image is used for every character. El Ayachi et al. [4] 
presented a solution of rotation and noise problems by using dynamic programming, invariant moments and 
Walsh transform. The system produced good results however combining multiple descriptors make the system 
slower. El Kessab et al. [5] combined Multi-Layer Perceptron and HMM for more discriminative system which 
proven to be reliable in term of recognition rate but slower in term of CPU time. Gounane et al. [6] proposed 
a hybrid approach by combining neural networks and b-gram. The system produced satisfactory results, 
however recognition rate can be improved using bigger corpus. Es-saady et al. [7] presented a system where 
the horizontal and vertical baseline positions are the features of the character. The system showed better results 
when baselines are replaced by centerline. 


K 
i 
X 
V 
z 
I 
ul 
L 


oo 


pie eae (EE [pal SS) 1) 6) AE 


O 





Figure 1. Tifinagh characters 


Comparing with English, Arabic or Chinese, research on the recognition of Amazigh writing has not 
reached perfection. As far as authors know, very few attempts have been made on the recognition of Amazigh 
handwriting. Amrouch et al. [3] described a global approach for the recognition of handwritten Tifinagh 
character in which they used Hough transform as feature extraction method and Hidden Markov Models 
(HMM) for classification. The system produced promising result, however the discrimination of this model is 
not good enough due to learning step when a single image is used for every character. El Ayachi et al. [4] 
presented a solution of rotation and noise problems by using dynamic programming, invariant moments and 
Walsh transform. The system produced good results however combining multiple descriptors make the system 
slower. El Kessab et al. [5] combined Multi-Layer Perceptron and HMM for more discriminative system which 
proven to be reliable in term of recognition rate but slower in term of CPU time. Gounane et al. [6] proposed 
a hybrid approach by combining neural networks and b-gram. The system produced satisfactory results, 
however recognition rate can be improved using bigger corpus. Es-saady et al. [7] presented a system where 
the horizontal and vertical baseline positions are the features of the character. The system showed better results 
when baselines are replaced by centerline. 

To summarize, all research proposed for the recognition of Amazigh handwriting used statistical 
approach, where features are represented as a dimensional point in the corresponding vector space, which allow 
the use of vector space properties to resolve classification problems. 

In this work, a handwritten character recognition system based on a structural approach is proposed. 
In this approach the patterns are devised into several parts then represented by a graph. This graph describes 
the properties and spatial position of those parts including their relationship between each other. 

This paper is organized as follows: in section 2, the preprocessing phase is described. Section 3 deals 
with the structural features extraction phase. Section 4 deals with the classification phase and finally, in the 
last section experimental results are discussed. 
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2. PREPROCESSING 

The purpose of preprocessing is to produce a clean version of the input image by keeping relevant 
information which differentiate the pattern in the feature space. 

In this step, The RGB image is converted into a binary image. In fact, the well-known Otsu Method 
[13] 1s applied to the Red, Green and blue channels of the input image. Gathering those three monochrome 
images gives us a binary image with the maximum relevant information of the pattern. This step allowed to 
remove low to medium level of noise from the image. 

Based on the monochrome version of the input image, the unwanted areas are removed from the 
image. It consists of scanning the vertical histogram horizontally and the horizontal histogram vertically to find 
character borders. The space between those borders and the image borders 1s removed. 

To simplify the extraction structural feature; the normalized monochrome image is thinned using 
Zhang-Suen algorithm [8]. For a better thinning results, this algorithm is used as follow: 

1. Apply thinning on connected components instead of applying it to the entire image. This allows us to treat 
the case of removed components. 
2. Smooth the skeleton by eliminating pixels that do not affect the connectivity and topology of the character 

Figure 2 summarize the steps of preprocessing of a handwritten Tifinagh character. 





Figure 2. Thinning result example: (a) Standard thinning Algorithm, (b) Enhanced thinning algorithm 


Figure 3, illustrates the execution time of the process of pre-processing of the Tifinagh handwritten 
characters and those of the printed characters used in previous work [11]. The preprocessing time of 
handwriting is obviously greater than that of the printed writing because the latter requires much more 
preprocessing. 
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Figure 3. Pre-processing time of handwritten and printed characters 
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3. STRUCTURAL FEATURE EXTRACTION 

The structural descriptions consist of extracting the topological and geometrical properties of the 
character with a high tolerance to distortions and variations of style. This type of representation can also encode 
some knowledge about the structure of the object or can provide some knowledge about the components 
constituting this object. 

Graph representation is one of the most adequate ways to encode the structure information of a pattern 
(in our case Tifinagh character). It is a formal mathematical representation of a set of objects and their 
relationships. Each object is called vertices. The relationships between objects are called edges. More formally, 
a graph G is defined as an ordered pair G = (V,E) where V is a set of vertices and E is a set of edges that 
define the connectivity between a pair of vertices. 

The adjacency matrix is a way to represent the graphs. It is a binary square matrix AM where the 
number of vertices | V | is its size. The entry in row i and column j is nonzero if and only if the edge (i, J) is in 
the graph which means: 

AM(i,j) =1, anedge is connecting vertice i and j \ 
a (i,j) = 0, tandj are not connected (1) 


In this case, authors worked on undirected graphs whose edges direction is not important. This 
representation is fast and compact. However, there is a redundancy of information since the matrix is 
symmetric. 

To represent Tifinagh characters by graphs, key points are extracted first. They will serve as delimiter 
for character segmentation into line segments and nodes in graph representation. In a perfect character skeleton, 
a foreground pixel is considered as a key point by looking to the number of foreground pixels in its 
neighborhood as shown in Figure 4. If the number of neighbors is one then that pixel is considered as an end 
point (as shown in Figure 4.a). If the number of neighbors is more than two then that pixel is an intersection 
point (see Figure 4.b). However, a skeleton is rarely perfect no matter what thinning algorithm is used, two or 
more-pixel width part are often found in the skeleton. To remedy to this issue, the number of the transitions 
from a background pixel to a foreground pixel is considered instead of the number of foreground pixels (see 
Figure 4.c). Which means that a pixel is classified as a primary key point if the number of transitions is one or 
more than two. 





Figure 4. Example of pixel neighborhood, (a): end point, (b): intersection point, (c): none key point pixel 
since the number of transactions from white pixel to black pixel is two 


End and intersection points are not enough to divide the character skeleton into several line segments. 
Secondary key points are needed between segments that are curvier then straight. To do so, an algorithm is 
proposed which is similar to the algorithm authors proposed in [14] that allows to divide a curvy segment into 
several line segments. It is based on the fact that the smallest distance between two points (in our case those 
two points are the end of segments extracted based on end and intersection points) 1s the straight line connecting 
them. 

The algorithm starts by extracting segments between primary key points (end and intersection 
points). Every segment is classified into a straight line or curvy line. This is done using a threshold which is 
the length of segment D divided by the Euclidian distance d its end. The value of the threshold, calculated 
empirically, is 0.2. The example for “yah” character key points extraction using authors proposed algorithm 
shown in Figure 5. 
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Figure 5. “yah” character key points extraction using authors proposed algorithm 


When an arc is detected, the orthogonal distance between the segment elements and the straight line 
is calculated. The element that has the maximum perpendicular distance is designated as an inflection point. 
Every time a point is added to a segment, two new segments replace the former segment in the segment list. 
The algorithm used for the extraction of the key points is as follows: 

. Extract primary key points. 

. Extract segments based on these points. 

. For every segments, checks if it is an arc. 

. For every arc, make the point that have the biggest orthogonal distance as a key point. 

Divide arc segment into two segments where the new key points become an end of the new segments. 
. Update the list of segments. 

. Repeat step 3 to 6 until no arc segment is detected. 

To evaluate the performance of the proposed algorithm, for each Tifinagh handwritten character, the 
number of key points extracted by this algorithm is compared to that extracted by the well-known Harris 
algorithm [15] and the optimal number of points (calculated empirically) necessary for the graphical 
representation; and on the other hand, the execution times of these algorithms. This comparison, illustrated in 
Figure 6 and 7, shows that our algorithm is more efficient than Harris in terms of the speed and the extraction 
of the key points. 
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Figure 6. Comparison of the number of points obtained by the Harris method, the proposed algorithm and 
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Figure 7. Execution time of the Harris algorithm and the proposed algorithm, case of handwritten Tifinagh 


characters 


Now that the character skeleton is divided into several line segments and key points, a graphical 
representation can be applied where the nodes represent the key points and edges represent segments 
connecting these points in pairs. The graph construction algorithm is proceeded as follows: 


Let AM be ann X n adjacency matrix where n is the number of key points. 
For every segment, four information are extracted: first key point, second key point, segment orientation O, 
segment length L. 


Based on this information, the adjacency matrix is constructed as follow: 
AM(i,j) = w, key point i is connected to key point j 
Coane = 0, else (3) 
Where, 
w=2xL+0 (4) 
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This representation is faster than in the case of the use of key points extracted by Harris. This is 
illustrated in Figure 8, which gives the execution time of the algorithm of the graphical representation for each 
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Figure 8. Execution time of the graph representation algorithm characters per key points extracted by the 
algorithm and Harris algorithm 


4. CLASSIFICATION 

The used spectral method [9] aims at finding a coherent agreement between two graphs. This is done 
by calculating Eigen values and Eigen vectors of the graph product (or Similarity matrix) of the two graphs. 
the biggest Eigen value of the graph product designs the degree of similarity between the two graphs. Readers 
are invited to check the original paper that is cited for more details. 


5. EXPERIMENTAL RESULTS 

To evaluate the overall performance of the proposed system, authors used two database sets of 
handwritten Tifinagh characters. The first database [10] 1s composed of 1376 images; 43 images for each 
character. It was created with the help of 30 people and other 13 to take in consideration inclination issues. The 
second database [12], is composed of 25740 images; 780 images for each character. It was created with the 
help of 60 person with deferent ages. During tests, 80% (1100 images for the first database and 20592 images 
for the second database) of these databases are used as reference images and 20% (276 images for the first 
database and 5148 images for the second database) are used as test images. These experiments are done using 
a laptop with 2.6 GHz speed 17 dual core processor, 8 GB RAM and MATLAB software. 

The recognition rate results obtained are illustrated in Table 1. A recognition rate of 94% is obtained 
on the first database and 85% for the second database. The decreased recognition rate from the first to the 
second database 1s due to the increasing of badly written characters, the increasing of noise level or the loss of 
important information in the acquisition phase. 


Table 1. comparison of recognition and error rate of the proposed system with other systems 


First database [10] Second database [12] Local Database 
Authors proposed Recognition rate (%) 94 85 - 
system Error rate (%) 6 15 - 
Recognition rate (%) - - 90.4 
a Error rate (%) - - 9.6 
Recognition rate (%) - - 923 
El Kessab et al. [5] Error rate(%) ; ; 77 
Recognition rate (%) - 94.96 - 
pene ever Error rate (%) - 5.04 - 
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6. CONCLUSION 

In this paper, recently, proposed Optical Character Recognition (OCR) system is applied to 
handwritten Tifinagh characters. The main contribution of this paper is the application of a novel key point 
extraction technique and graph matching on handwritten Tifinagh characters. This technique showed great 
results in term of pattern description and speed compared to the well-known Harris method. Experimentation 
on two different databases proved the good performance of the system. However, many enhancements are 
required, especially in the classification process. A better classification technique will be proposed in the future 
work. 
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